Toward Predictive Chemical Deformulation Enabled by Deep Generative Neural Networks

نویسندگان

چکیده

The design of chemical formulations is a challenging, high-dimensional problem. In typical formulations, tens thousands ingredients are available for use, yet only tiny fraction end up in given formulation. Deformulation, the problem reverse engineering precise amounts each ingredient starting from just list ingredients, similarly challenging but key capability staying up-to-date with industry competitors. Here, we take advantage large, curated dataset CAS, division American Chemical Society, which offers consistent and highly structured representation identities their components to show that variational autoencoder neural network learns meaningful representations various product classes such as antiperspirants oral care. Furthermore, it can be used conjunction two-step sampling algorithm generate accurate amount suggestions deformulation. Deformulation using produces estimates significantly more than nearest neighbor methods, extrapolates better different previously seen provides way leverage large datasets industrially relevant capabilities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Face Deidentification with Generative Deep Neural Networks

Face deidentification is an active topic amongst privacy and security researchers. Early deidentification methods relying on image blurring or pixelization were replaced in recent years with techniques based on formal anonymity models that provide privacy guaranties and at the same time aim at retaining certain characteristics of the data even after deidentification. The latter aspect is partic...

متن کامل

Deep Generative Stochastic Networks Trainable by Backprop

We introduce a novel training principle for probabilistic models that is an alternative to maximum likelihood. The proposed Generative Stochastic Networks (GSN) framework is based on learning the transition operator of a Markov chain whose stationary distribution estimates the data distribution. The transition distribution of the Markov chain is conditional on the previous state, generally invo...

متن کامل

Generative Deep Neural Networks for Dialogue: A Short Review

Researchers have recently started investigating deep neural networks for dialogue applications. In particular, generative sequence-to-sequence (Seq2Seq) models have shown promising results for unstructured tasks, such as word-level dialogue response generation. The hope is that such models will be able to leverage massive amounts of data to learn meaningful natural language representations and ...

متن کامل

Privacy-preserving generative deep neural networks support clinical data sharing

Though it is widely recognized that data sharing enables faster scientific progress, the sensible need to protect participant privacy hampers this practice in medicine. We train deep neural networks that generate synthetic subjects closely resembling study participants. Using the SPRINT trial as an example, we show that machine-learning models built from simulated participants generalize to the...

متن کامل

Generative learning for deep networks

Learning, taking into account full distribution of the data, referred to as generative, is not feasible with deep neural networks (DNNs) because they model only the conditional distribution of the outputs given the inputs. Current solutions are either based on joint probability models facing difficult estimation problems or learn two separate networks, mapping inputs to outputs (recognition) an...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Industrial & Engineering Chemistry Research

سال: 2021

ISSN: ['0888-5885', '1520-5045']

DOI: https://doi.org/10.1021/acs.iecr.1c00634